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ABSTRACT 

We investigate methods to best estimate the normalisation of the mass density fluc- 
tuation power spectrum (as) using peculiar velocity data from a survey like the Six 
degree Field Galaxy Velocity Survey (6dFGSv). We focus on two potential problems 
(i) biases from nonlinear growth of structure and (ii) the large number of velocities 
in the survey. Simulations of ACDM-like models are used to test the methods. We 
calculate the likelihood from a full covariance matrix of velocities averaged in grid 
cells. This simultaneously reduces the number of data points and smooths out non- 
linearities which tend to dominate on small scales. We show how the averaging can 
be taken into account in the predictions in a practical way, and show the effect of the 
choice of cell size. We find that a cell size can be chosen that significantly reduces 
the nonlinearities without significantly increasing the error bars on cosmological pa- 
rameters. We_c^m^are om^sults wi th those from a prin cipal components analysis 
following Wat kins et all ([2002) and iFeldman et ail (2003) to select a set of optimal 
moments constructed from linear combinations of the peculiar velocities that are least 
sensitive to the nonlinear scales. We conclude that averaging in grid cells performs 
equally well. We find that for a survey such as 6dFGSv we can estimate as with less 
than 3% bias from nonlinearities. The expected error on as after marginalising over 
Q m is approximately 16 percent. 

Key words: large-scale structure of universe - cosmological parameters - surveys - 
galaxies: kinematics and dynamics - galaxies: statistics 



1 INTRODUCTION 

The formation of structure is assumed to have arisen from 
seed perturbations created by quantum fluctuations of a 
scalar field during the inflationary era in the first moment 
of the universe. These tiny initial fluctuations in the den- 
sity field were then amplified by gravitational instability 
into the structure we perceive around us today. This the- 
ory is supported by recent detections of the baryon acoustic 
oscillations in g alaxy power spectra ([Eisenstein et al ] l2005l : 
ICole et aD l2Q05h . Because these initial fluctuations are as- 
sumed to be a Ga ussian random field, as suggested by in- 
flationary theory ((Peebles 1993: iBardeen et alJll986h . their 
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distribution will be fully characterised by their power spec- 
trum. On sufficiently large scales these fluctuations remain 
linear, even up to the present day, therefore we can simply 
relate the fluctuation power spectrum today to that at early 
times. 

The distribution of galaxies in the universe is not likely 
to be the same as the distribution of matter, since most of 
the mass is in the form of indirectly detectable dark matter. 
It is known that galaxies of different types cluster differently 
(|Dresslerlll980l : IConwav et aTll2005h . so they are clearly not 
completely unbiased tracers of the underlying mass, and this 
issue is referred to generically as galaxy biasing. 

Due to its very nature, it is difficult to obtain infor- 
mation on the dark matter distribution. In our local uni- 
verse the most direct method is to observe the velocities of 
galaxies relative to the Hubble flow, which arise from the 
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gravitational pull of the dark matter. The peculiar veloc- 
ity field is a useful tool for probing the matter distribution 
as galaxy velocities are likely to be unbiased traces of the 
matter velocity field, which in turn is simply related to the 
density field in linear theory. Since peculiar velocities are 
a non-local function of the dark matter distribution then 
analysing the peculiar velocity field pr ovides information on 
scales larger than the sampled region (Hoffm an et alJ feoOl) 
as the velocity at a point is determined by the integral over 
the matter distribution in a large volume. 

In practice peculiar velocities are complicated by several 
factors. The major one is that on small scales the density 
field is highly nonlinear; these effects leak into the velocity 
field and cannot be described analytically. A method of ad- 
equately separating the contribution from small scales com- 
pared to that of large scales must be sought, and this is the 
problem we focus on in this paper. 

Another major factor is the accuracy of the peculiar 
velocity measurements. This relies on knowing the distance 
to the galaxy through the distance-redshift relation which 
at low redshift is cz = Hod + v pec , where the redshift z 
is the measured spectroscopic redshift, d is the distance to 
the galaxy and v pec is its radial peculiar velocity. There- 
fore to measure the peculiar velocity the distance to the 
galaxy must first be measured, and this is itself a compli- 
cated task. Relying on the correct calibration of distance 
indicator relations, the calculated distance is a relative dis- 
tance measure which is strongly subject to a number of bi- 
ases and also has very large uncertainties of around 20 per- 
cent, all of which tran slates to the peculiar velocity (see 
IStrauss fe Wil lick 1995, for a review) . The simulations used 
in this paper contain the large statistical error but the effect 
of additional biases is beyond the scope of this paper. 

Velocities are most sensitive to the cosmological pa- 
rameter as, roughly the amplitude of the power spectrum 
at a scale of 8/i _1 Mpc, which is a measure of how clus- 
tered matter in the universe is toda}{3- It is still not well 
constrained by any cosmological probe. The Wilkinson Mi- 
crowave Anisotropy Probe (WMA P) Cosmic Micro wave 
Background (CMB) measurements ([Spergel et al.ll2007l ) rely 
on evolving the anisotropics forward to the present to in- 
fer the value of as, because it is defined at the present 
epoch. This will depend on many parameters that affect the 
growth of structure, such as the mass of the neutrino and 
the amount or type of dark energy. Allowing the mass of the 
neutrino to vary significantly alters the WMAP constraints 
on as (see Table [T]). 

Peculiar velocities provide the only way to measure as 
essentially at redshift zero. Weak lensing, Lyman-a forest 
and cluster measurements are obtained only at higher red- 
shifts, where dark energy was just starting to dominate. See 
Table [1] for some current constraints on as . The range of 
values could possibly lie anywhere between 0.5 and 1. The 
differences in the values of as will of course be in part due 
to the fact that they are from different experiments, and 
different parameters have been marginalised over. Pinning 
down an accurate value of as today could help discrimi- 
nate between different models that affect clustering and the 



t Strictly, it is the rms fluctuation of density in spheres of radius 
8/z _1 Mpc at the present day, in linear theory. 



Table 1. Some recent 68% confidence limits for as using vari- 
ous cosmological probes;: cosmic microwave background (CMB), 
weak lensing (WL), Lyman-a forest (Lya), cluster measurements 
(CL) and supernovae (SN). This shows that as could reasonably 
lie anywhere in the range 0.5 to 1. 



Authors 



Probe 



as result Ref 



WMAP3 (ACDM) 


CMB 


0.76±0.05 


1 


WMAP3 (ACDM+m,) 


CMB 


0.56±0.10 


2 


Rozo et al. 2007 


CL 


0.92±0.10 


3 


Benjamin* et al. 2007 


WL 


0.78±0.05 


4 


Massey* et al. 2007 


WL 


091 +o.09 

-0.07 


5 


Gladders et al. 2007 


CL 


U,U ' -0.13 


6 


Seljak et al. 2006 


Lyai+CMB 
+CL+SN 


0.85±0.02 


7 



* as suming f2 m =0.27 

1. 2:| Spergel et al.l (120071) 3 :lRozo et al.l (l2007h 4: Beniamin et al. 
d2007h 5: iMassev et all (I2007T) 6: Idadders et al.l (|2007h 7: 
ISeliak. Slosar. fc McDonaldl <2006h 



growth of large scale structure such as dark energy models 
and modified gravity. 

The motivation for this paper is the upcoming release of 
the peculiar veloci ty data from the Six Degree Field Galaxy 
Survey (6dFGS, I Jones et al.l \2004 ). 6dFGS has measured 
the redshifts of around 150 000 galaxies over almost the 
entire southern sky, with a subsample of around 12 000 
galaxies having peculiar velocity measurements, an order 
of magnitude larger than any peculiar velocity survey to 
date. All previous peculiar velocity surveys (eg ENEAR , 
Spiral Field I-Band (SFI) fe Mark HI: Ida Costa et aUbOOd: 
i Giovanelli et al.l Il998l : Ida Costa et al.l Il996l : IWillick et al.l 
1 19971 ) have traced the velocity field only out to distances 
of around 7000 kms -1 and suffered because of uneven sky 
coverage and the small number of galaxies. In addition to 
the new 6dFGS data an extended SFI sample (SFI++, 
Masters et al. 2006; So ringob et ai1 l2006) has recently been 
released consisting of around 5000 spiral galaxies. Further- 
more the ever growing Typ e la supernova samples (eg 
Ijha. Riess. fe Kirshnerl 2007) mean that there is a wealth 
of data becoming available for peculiar velocity analysis. 

Results from previous surveys which apply likelihood 
analysis fsee lZaroubi et al.lll997l . l200ll : lFreudling et al.lll999l . 
for Mark III, ENEAR and SFI respectively) seemed to over- 
estimate the combination os^m 6 significantly compared to 
other probes at the time and to the current concordance 
cosmology. Values of as from those analyses were in the 
regio n of 1.7 to 2.4 a fter assumin g Q m — 0.27. S tudies 
by iHoffman fe Zaroubil (|200Cl ) and ISilberman et al.1 <|200lh 
show that this over- estimation may be due to inaccurate 
modelling of the nonlinear part of the power spectrum, the 
small scales. This paper focuses on testing our ability to 
remove the bias from the nonlinear signal. 

In this paper we develop a practical approach in which 
we bin the velocities on a grid and thus erase small scale in- 
formation. Since this reduces the number of data points then 
it also makes a covariance matrix approach computationally 
feasible using the large amount of data from upcoming sur- 
veys. 

The paper is organised as follows. In Section [2] we in- 
troduce the radial peculiar velocity correlation function and 
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likelihood analysis which the two methods used in this pa- 
per utilise in different ways. In Section [3.11 we describe the 
binning method where the galaxies used in this analysis are 
binned on a grid. The effect of the size of the grid cells on 
th e results is shown. In Secti on T3. 2 1 we overv i ew th e method 
of IWatkins et al.1 (|2002h and lFeldman et al.1 (|20Q3h by using 
a more sophisticated principal components analysis (PCA) 
to remove the moments which are most sensitive to the non- 
linear scales. In Section U we describe the mock catalogues 
used in testing the methods. The results are presented in 
Section [5] 



2 PRELIMINARIES 



The a n alysis used in this S ecti on follows Freudling et all 
(Il999h IZaroubi et all (| 19971 ) & IZaroubi et al.1 (|200lh and 
I Kaiser! () 19881 ). We use linear theory to predict the veloc- 
ity correlation function and use a multivariate Gaussian to 
calculate the likelihood. 



2.1 Peculiar velocity correlation function 

To estimate cosmological parameters we compute the radial 
peculiar velocity correlation function (hereafter VCF) from 
linear theory for each parameter set. This forms the basis 
for the prediction which will be compared to the peculiar 
velocity data using the likelihood function, see Section 12.21 
below. The peculiar velocity with observational error can be 
represented as • Ti = v% — s% + e;, therefore the observed 
VCF is defined by 



Rij = (viVj) (siSj) + (aej) 

— &3 



(1) 

(2) 



and the average is over realisations of the universe. The first 
term is the signal VCF and the second term is the contri- 
bution from the errors in the velocity measurements. The 
errors are assumed to be uncorrelated so they only affect 
the diagonal terms in Rij. In linear theory, the signal part 
£ij can be spl i t up i nto perpendicular and parallel compo - 
nents (Gorski Il988l : iGroth. Juszkiewicz. &; Ostrikerl 11989). 
which are scalar functions of r=|r| 



£ij = cos Oi cos | (r) + sin Oi sin Oj^±(r) 



(3) 



where the angles are defined by cos Ox = rx -r and the diag- 
onal elements £u are given by Eq. [5] below. The ^\\(r) and 
*&±(r) can be calculated from the matter power spectrum 
using linear theory and assuming all galaxies are approxi- 
mately at redshift zero 



2tt 2 



P(k)B lu± (kr)dk 



(4) 



where B± — j (x)/x and B\\ = j (x) and jo, jo are the first 
and second derivative of the zeroth order spherical Bessel 
functions respectively, Ho is the Hubble constant and Q m is 
the density of matter in the universe normalised by the crit- 
ical density, f(Qm) is the derivative of the growth function. 
The auto correlation is given by 



I H% f (Sim) 

3 2tt 2 



The dependence on as enters through the normalisation of 
the power spectrum P(k). The dependence on Q m enters 
through f(Q m ) = f2^, where 7 ~ 0.5 57 (found from a fit 
to the growth function at redshift zero, IWang &; Steinhardtl 
(1998)) and through its effect on the shape of the matter 
power spect rum. The power spectrum P(k) is g enerated us- 
ing CAMB ([Lewis. Challinor. & Lasenbvll2000h . The above 
equation for / assumes that the galaxies are at low red- 
shift. The full equation contains the growth rate at the red- 
shift of each galaxy. Peculiar velocity surveys using distance 
indicator relations (as 6dFGS does ) such as t he fundamen- 
tal pl ane ([Diorgovski &; Davislll987h or D n -cr ([Dressier et al.l 
1 19871 ) are unlikely to have data beyond a redshift of 0.05. 
The growth rate increases by less than 1 per cent between 
redshift zero and redshift 0.05 for a flat ACDM model with 
= 0.3, so this is a good approximation for this paper. 
We also use the approximation that the Hubble expansion 
is constant, described simply by a constant expansion rate 
for all galaxies in the survey, when converting between dis- 
tance and redshift in both the simulations and the likelihood 
analysis. 

The two component correlation functions ^\\ and 
are illustrated by the solid lines in Figure [1] respectively. 
They represent the correlations between two galaxies which 
have their separation axis aligned exactly parallel (^||) or 
perpendicular to the line of sight. As expected the cor- 

relation between the velocities of two galaxies decreases with 
increasing separation: the further apart they are the more 
likely different potentials are the main source influencing 
their motion. It is interesting to note the negative correla- 
tion in after scales of about 75/i _1 Mpc. Two galaxies 
separated along the line of sight are likely to be composed 
of one in-falling and the other back-falling into the same 
over-density region, therefore having radial velocities of the 
opposite sign creating the negative correlation. In general 
pairs of galaxies have contributions from each of these func- 
tions, the amount from each dependent on the angle between 
the separation axis and the line of sight, represented in Eq.[3] 
by the sine and cosine functions. 



2.2 Likelihood analysis 

If m is the vector of model parameters, e.g. [ag Ho], and 
d is the vector of N data points, then Bayes' theorem states 
that the posterior probability density of a model given the 
data is 



P(m I d) 



P(m)P(d I m) 



(6) 



P(k)dk. 



(5) 



The denominator is a normalisation constant which we di- 
vide out in our analysis. The probability of the data given 
the model, P(d | m), is the likelihood function which we 
discuss below. 

We may want to find the set of parameters m ax — 
..0n] that maximizes the posterior, and this is equiva- 
lent to finding the set that maximise the likelihood func- 
tion, for the case of a uniform prior probability P(m). To 
calculate the covariance matrix R it is necessary to calculate 
the power spectrum for a given set of cosmological parame- 
ters m fSection l2.ip . Assuming that the peculiar velocities 
and the observational errors are Gaussian random fields the 
likelihood function can be written as 
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Figure 1. Left panel ^n: the correlation between two galaxies which have their separation axis aligned exactly parallel to the line of 
sight. Right panel the correlation between two galaxies which have their separation axis aligned exactly perpendicular to the line of 
sight. The solid line represents ACDM with crs=0.9, the dashed line represents ACDM with as =0.7, the dotted line represents ACDM 
with (78=0.9 and where the functions smoothed on a scale of 25fo _1 Mpc. Note the closer similarity between the functions with the lower 
value of a 8 (dashed) and the higher ctq + smoothing (dotted), than when the values of as are the same but one function is smoothed 
(dotted and solid). 



L = - 1 exp f -i \^ vi (R %• Vj ) . (7) 
V(2tt)"|R| V 2 77 / 

where v% are the observed radial velocities. 



3 METHODOLOGY 

The methods described in this paper both use the pecu- 
liar velocity correlation function and likelihood analysis de- 
scribed above. This Section describes how we apply these 
techniques to the peculiar velocity data in practice. 



3.1 Gridding Method 

The gridding method is a way of averaging together the 
peculiar velocities of spatially close galaxies by laying a 
grid across the survey. Averaging over a number of galaxies 
should allow the linear signal to dominate. It is similar to a 
counts- in- cells approach used in galaxy surveys. We discuss 
a simple way to bin the velocity field and detail a practical 
approach to take account of the binning accurately in the 
VCF. The technique we implement here is designed to be 
simple and fast. 

The likelihood analysis outlined in Section 12.21 above 
uses the individual galaxy peculiar velocities, Vi, as the data. 
Determining cosmological parameters in this way does not 
take account of the nonlinear part of the peculiar velocity 
signal because, as stated above, we make our prediction for 
the VCF based only on linear theory. The density field be- 
comes nonlinear only on small scales, above a wave number 
(k) of about 0.2/i Mpc -1 . 

We lay down a grid across the survey and average to- 
gether all the peculiar velocities within each grid cell so that 



where v' m is the radial peculiar velocity of the cell and e m is 
the error on the velocity of the cell; the angle brackets denote 
an average over all galaxies i within the cell m. Here there 
is the approximation that the grid cells are small enough so 
the radial components to each galaxy within each grid cell 
are parallel. This is tested when we apply the method to 
both linear and nonlinear simulations and we do not see a 
significant problem from this approximation. Note that €i is 
the contribution to the correlation function from the random 
velocity errors of each galaxy and therefore the remaining 
contribution t m to the binned correlation function is reduced 
by the square root of the number of galaxies. 

Averaging the data over a volume of space will essen- 
tially smooth the velocity field which is equivalent to damp- 
ing the small scale contributions. This reduces the observed 
correlations because we average away some of the signal. 
If the data is averaged on a grid and inserted directly into 
the equations in Section [2] without accounting for the aver- 
aging in the correlation function then the cosmological pa- 
rameters will be biased. This is illustrated in Figure [1] The 
smoothed parallel and perpendicular correlation functions 
(dotted lines) are more similar to the parallel and perpendic- 
ular correlation functions with a lower as (dashed lines) than 
the unsmoothed functions with the same as (solid lines). 

This type of binning is then taken account of by mul- 
tiplying the power spectrum in Eq. [4] and [5] with a window 
function corresponding to the size and shape of the grid cell. 
This is because the binning in real space can be written as a 
convolution with a kernel followed by a sampling at the bin 
centres. The convolution kernel W(x) is uniform within a 
bin centered on the origin, is zero outside and normalised to 
have unit integral. In Fourier space this convolution is sim- 
ply a multiplication, and the Fourier space window function 
is the Fourier transform of the real space window function 



= ( v i)iem 



(8) 
(9) 



W(k) = 



8 sin (k x % ) sin sin (fc*§) 

kx ky kz 



(10) 
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where L is the length of a side of a cell in the grid, and the 
angle brackets denote an average over Fourier space direc- 
tions. This means equations that Eq. |4] and [5] become 

*Il,±(r) = Hlf 2 ^ m) J W 2 (k)P(k)B lu± (kr)dk (11) 

tL* = l H ° f 2? m) J W\k)P{k)dk (12) 

where we use the position of the cell centre to calculate all 
the required distances. The corresponding VCF and likeli- 
hood are then formed using the smoothed quantities. 

Unfortunately the method for accounting for the veloci- 
ties described above assumes the data inside each grid cell is 
a continuous field, whereas it is in fact discrete values at the 
locations of the galaxies; we shall refer to this as the sam- 
pling problem. Galaxies trace discrete points of the peculiar 
velocity field, but if enough discrete points are averaged over 
then they will closely approximate averaging over a contin- 
uous distribution. However, some grid cells will not contain 
enough galaxies to provide a reasonable measure of the av- 
erage of the velocity field within that cell, perhaps due to 
masked out areas in the survey or poor sampling in some 
areas. The average radial velocity in a large volume of space 
tends towards zero; conversely when averaging over just a 
few galaxies in one cell the standard deviation of the veloc- 
ities in that cell will be much higher than in well sampled 
cells. If this effect is unaccounted for, the theory prediction 
will have to be larger to match the observed large velocity, 
and thus will bias as high. Since the errors on the observed 
radial peculiar velocities are so large we find that in prac- 
tice this bias is largely concealed by the noise, particularly 
as the error on each cell velocity is weighted by the number 
of galaxies in each cell, see Eq. [9] so under-sampled cells 
are already weighted down by have larger noise than well 
sampled cells. 

If we were to apply a truly accurate correction a sepa- 
rate window function would have to be calculated for every 
grid cell in the survey, and it would depend on the positions 
of the galaxies on the cell. This kind of calculation would be 
difficult and time-consuming. Instead we propse an approx- 
imate approach which we find to be sufficiently accuracte, 
as shown in Section 15.11 

This problem can be effectively corrected by reducing 
the size of the modification to the diagonal elements of the 
VCF, according to the number of galaxies in each cell 

£Corr >•/ , (<^mm <vmm) q\ 

Smm Smm i V-'-'-v 
Tim 

where is the corrected value used to calculate the diag- 
onal elements of the VCF. This correction uses the correct 
value for the diagonal elements of the correlation function 
in the limit that there is just one galaxy in the cell and 
also in the limit that there are infinite galaxies in the cell 
(a continuous field) . For very small numbers of galaxies in a 
large cell £ mm is small compared to £ mm and the velocities 
maybe relatively independent. In this case the diagonal ele- 
ments decrease by the number of galaxies in the cell, by the 
same logic used to write Eq. [9] For intermediate numbers of 
galaxies the value of diagonal elements of the VCF should 
be an improvement on using Applying this correction to 
just the auto correlations is reasonable since we find that 
they have the most power in constraining as. 



Another popular method of grouping galaxi es is a 
Friends-of- Friends algorithm ([Huchra &; Gellerlll982n . Aver- 
aging groups identified by the Friends-of-Friends algorithm 
would result in a large range of group volumes due to the 
decrease in galaxy sampling with distance, unobserved fields 
in the survey and masked out areas due to the galactic 
plane or stars. It would then be inappropriate to account 
for this smoothing with a window function of only one dis- 
tinct length scale or shape. Instead a much more complicated 
method would have to be found. The grid method of bin- 
ning allows the problem to be tackled in the opposite way: it 
imposes a simple and regular volume average onto the sur- 
vey, and therefore it is simple to account for the smoothing 
within the theoretical predictions. 

3.2 PCA method 

This method was dev eloped by Watkins et al . (2002) and 
iFeldman et al.l (|2003h in the context of peculiar velocity 
analysis and uses Karhunen-Loeve methods of data com - 
pression (iKendall fe Stuartll 19691 : Ik ennev fe Keepind[l954h . 
A set of moments is created from the radial velocities which 
are insensitive to small scales and therefore to the nonlineari- 
ties. These moments are then used in the likelihood analysis, 
see Section \2. 21 The moments m are linear combinations of 
the line of sight peculiar velocities Vj 

N 

m — ^ BijVj (14) 

where B s a constant N f x N matrix, N f is the number of 
moments retained and N is the original number of galaxy 
velocities in the survey, so N' < N. If N' is less than N there 
will be a loss of information, but by choosing the matrix B 
correctly this loss of information will be mostly associated 
with the velocity signal at small scales and therefore will 
tend to remove the effect of the nonlinearities. 

A simple model for the power spectrum is considered 
in which the power on nonlinear scales is proportional to 
a single parameter q . To be specific, we write the power 
spectrum as 

p(k) = p l (k) + e q p nl (k) (is) 

where P\(k) = for k > k n \ and P n \(k) = for k < k n \. To 
make the linear theory power spectrum P\(k) we run CAMB 
using the same cosmological parameters as used to simulate 
the observed velocities. Here k n \ is the wavenumber of the 
largest nonlinear density perturbation and we take this to be 
k n \ = 0.2/iMpc _1 . We take P n \(k) to be constant, P n \(k) = 
Po, over the range of nonlinear scales, k n \ < k < k c . This 
is not an accurate approximation to the nonlinear power 
spectrum however it has the benefit of greatly simplifying 
the calculations and essentially weighting all nonlinear scales 
equally. 

The rms velocity in the presence of nonlinear effects is 
larger than the diagonal elements of the velocity correlation 
function predicted by linear theory. We wr ite this difference 
as o~ ^ , as advocated in lWatkins et al.l (|2002h , and calculate it 
from noise-free nonlinear simulations. We use this difference 
to find a sensible value for Po , while setting the fiducial value 
of q to q = 1. The relation between velocity dispersion and 
the power spectrum is used to show that 
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Po 



(16) 



The choice of a maximum wavenumber k c reflects the fact 
that perturbations smaller than those which would form 
a galaxy will not contribute to a galaxy's velocity. It can 
be shown that the method is fairly insensitive to the exact 
choice of k c and k n \ (the effect is of the order of a percent 
on cr 8 ). 

If only a single moment is used as a data point then we 
can estimate the uncertainty on 6 q using the Fisher matrix 
formalism. The smaller the error on q , the more information 
is retained about small scales. By finding the weightings 
B n j of the velocities Vj which minimises the error on q 
we can find the single moment that carries the minimum 
information about small scales. This problem is solved by 
introducing a Lagrange multiplier, formulating an eigenvalue 
problem and solving it to find N orthogonal eigenvectors 
with corresponding eigenvalues, of the matrix M where 



Mi, 



1 SRij ! 
5 Atf ^ 



(17) 



and L is the Cholesky decomposition of the matrix R, R = 
L T L. The significant contribution of the uncertainties to the 
diagonal terms makes the matrix especially well suited for 
decomposition. The eigenvalue matrix, after sorting, can be 
shown to be equal to LB and thus B can be obtained after 
inverting L and applying it to the sorted eigenvalue matrix. 

Keeping moments up to some N' , throwing away the 
moments with the largest eigenvalues, will ensure the data 
most affected by the small scales is lost. We find the number 
of moments to use, N' using the Fisher matrix result 



A6 a 



f N' \ 



-1/2 



(18) 



4.1 Linear theory simulations 

Linear theory simulations were performed to check the ac- 
curacy of the approximate fix to the sampling problem. The 
galaxy positions were simulated to mimic the redshift dis- 
tribution of the SFI survey, as described in Section 14.41 We 
simulate galaxy velocities with the correlation function £ in 
Eq. [3] The number of galaxies in 6dFGS is so large that 
computation of the covariance matrix and eigenvalues was 
prohibitively slow. Therefore we verify the solution to the 
sampling problem on a smaller survey only. 



4.2 N-body simulations 

The ensemble of simulations use d in the curre nt work is a 
sub- set of the suite performed in lWarren et al. (|20Q6h i usini 
the Hashed-Oct-T ree code fHOT. ISalmon fe Warrenlll99- 
IWarren fc Salmon! Il992h a particle tree-code with periodic 
boundary conditions. More specifically, the self-gravity of 
the system of dark matter particles is evolved according to 



Npart. 

i 2 n 



dt 2 



Grrij 



n) 



r\ 2 +e 2 ) 3/2 
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+ H 2 (z)Q A n, (20) 



where n and mi are the position and the mass of the i-ih 
particle respectively. The Hubble Constant for a flat ACDM 
model at some redshift z is H 2 (z) = Ho [Q m (l + z) 3 + Ha] , 
with Ho = 100/ikms _1 Mpc _1 and ^a is the dimensionless 
cosmological constant at the present epoch. The second term 
on the r.h.s expresses an acceleration with respect to posi- 
tion of the z-th particle in the presence of the nonvanishing 
cosmological constant. The initial conditions were set up for 
a flat geometry universe with parameters as stated as at the 
start of this section. 



where A n are the eigenvalues. Since our true value of q 
is q = 1 we desire to have enough modes so we cannot 
distinguis h a from zero . We require this at la confidence 
following IWatkins et all <|2002h , thus A0 q = 1. 

The moments u% are statistically uncorrelated by de- 
sign, if we can assume that the peculiar velocities Vj are 
Gaussian random variables. This means information con- 
tained by the moments th rown away will be c ompletely re- 
moved from the data. See Wat k ins et al.l (|2002h for full de- 
tails. The likelihood function then becomes 



/ N' 

exp --^UiCRT^ijUj 



(19) 



where R BRB" 1 . 



4 MOCK CATALOGUES 

We test the methods for peculiar velocity analysis described 
in this paper on simulations. The cosmological parameters 
used in the simulations are as follows: as = 0.9, Q m =0.3 
where Q b 0.04, Q A 0.7, H 70kms _1 Mpc _1 and n s 
1. Two types of simulation were used, a simple linear theory 
simulation and a more realistic nonlinear N-body simulation. 



4.3 Making Mock Catalogues 

To extract a generic 6dFGS mock catalogues we used three 
1012 3 particle simulations in ^ ox = 384Mpc/i _1 boxes. 
The mass of the particles is 4.39 x 10 9 M© and the softening 
length 12.3kpc/i _1 . The following prescription to draw the 
catalogues from the above simulation was implemented: 

• U sing a Friends-of-Friends (FOF) method ([Frenk et al.l 
1988) we identified agglomerates of dark matter particles. 
To be a bona-fide halo e ach agglomerate sho uld have a min- 
imum of 400 particles (| Warren et al.l 120061 ) within an iso- 
density surface generated by a linking parameter b ~ 0.2 
times the average interparticle separation in the simulation. 
The halo velocity is the average velocity of the dark matter 
particles within a halo; 

• We split each of the three simulation boxes into eight 
sub-volumes of roughly the same size. To mimic the 6dFGS 
geometry (the southern sky sampled within w 170/i _1 Mpc ) 
in each sub-volume we implemented the following iterative 
sequence: 

- we chose three orthogonal directions and a halo ran- 
domly; 

- we carved a 6dFGS volume within the considered 
sub- volume having the chosen halo as an "observer" and 
the random orthogonal directions as the mock system of 



© 0000 RAS, MNRAS 000, 000-000 



Peculiar Velocities into the Next Generation 7 



-6dFGS sample 

- average mock catalogue 



tains 1289 galaxies. The locations of the galaxies were sim- 
ulated by picking 1289 galaxies to follow the SFI n(z) from 
the mock catalogues described in Section 14.21 above. The 
angular distribution of the galaxies was selected at random 
from the full survey area of sky of the 6dFGS mock cata- 
logues (roughly a hemisphere). The same galaxy positions 
were used for both the linear and the nonlinear simulations. 
This was to ensure a similar sampling for the positions of 
the galaxies in the linear simulation to the nonlinear simula- 
tion. This is important because we are testing the accuracy 
of our fix to the sampling problem, which is related to the 
distribution of galaxies. 



o> ' 1 1 ' ' 1 

0.01 0.02 0.03 0.04 0.05 0.06 

redshift 

Figure 2. A comparison between the n(z) of the mock catalogues 
and the n(z) of the galaxies in the velocity sample of the 6dFGS 
survey. The black (dark) line with the square points is the n(z) 
from a sample which the final 6dFGS sample will be drawn. The 
red (light) line is the average n(z) of the 21 mock catalogues. 
Both have been normalised by the number of galaxies the samples 
contain. 

coordinates. We returned to the previous step if this pro- 
cedure was not possible; 

- we computed the velocity field components to take 
into account the rotation from the simulation's frame to 
the new frame defined by the three random orthogonal 
directions. We went back and started the sequence on a 
new sub- volume. 

On average, the number density of halos in the simulations 
is ~ 0.0043 /i 3 Mpc~ 3 while the 6 dFGS number de nsity is 
10 -2.i26 / l 3 Mpc -s in the K _ hand (j Jones et alJl2006h . How- 
ever, the total number of 6dFGS galaxies with measured 
distance is roughly a tenth of the total; 

• A Monte Carlo rejection was used to choose halos ac- 
cording to the 6dFGS observed radial distribution (Camp- 
bell et al, in preparation). 

Each mock contains the real space positions and veloc- 
ities. Henceforth, in our analysis we will use 21 mocks. For 
the considered box size, L^ Q ^. — 384Mpc/i _1 , one can draw 
19.4 independent mocks: 6dFRS covers the Southern sky for 
|6| > 10° out to a distance of 170Mpc/i _1 . Thus the error 
bars calculated from the 21 mock catalogues are taken ac- 
count of accordingly. For more details regarding the 6dFGS 
mock catalogues see Teodoro et al (in preparation). Figure 
[2] shows a comparison between the n(z) of the mock cata- 
logues and the n(z) of the galaxies in the velocity sample of 
the 6dFGS survey. 

4.4 SFI redshift distribution 

Due to computing limitations performing the full likelihood 
analysis using all the individual galaxies can only be done for 
around one thousand galaxies. To show the effect of binning 
compared to no binning we therefore mimic the galaxy dis- 
tribution of an older survey which is smaller than the 6dFGS 
survey we are primarily interested in. For this purpose we 
roughly recreate the n(z) from the SFI survey, which con- 



4.5 Velocity errors 

To measure the radial peculiar velocity of a galaxy, the red- 
shift and distance to that galaxy first need to be measured. 
The velocity can then be found from v pec = cz — d. It is 
the measurement of the distance d that is so difficult be- 
cause the scaling relations between galaxy properties that 
are used to estimate the distance have a very large scat- 
ter. They are also s ubject to a number of biases, of which 
Malmquist bias (see IStrauss fe Wil lick 1995, for a review) is 
the most difficult to correct. The effect of Malmquist bias is 
also to over-estimate as and Qm which is the same direction 
as the effect of nonlinearities. Since the aim of this paper is 
to focus on quantifying how well the analysis removes the 
effect of the nonlinear signal, we take the idealised situation 
that we have correct the Malmquist bias perfectly. To repli- 
cate the effect of the scatter in the distance indicators we 
added a Gaussian random error of width 20% of the galaxy 
distance to ea ch peculiar velocity. T he same procedure was 
carried out in Feldman et al . (2003) when testing their op- 
timal moments method described above. 

This idealistic treatment of the velocity errors is not 
unreasonable because Malmquist bias can in principle be 
corrected if the line of sight density distribution is known. 
To correct for Malmquist bias in the real world usually the 
galaxy density distribution, or n(r), from a different galaxy 
survey is utilised. The correction only works if the galaxy 
survey used to compute the Malmquist bias corrections has 
the same underlying n(r) as the peculiar velocity survey. 
To mimic this situation for the analysis methods in this pa- 
per another galaxy distribution would have to be created 
from the mock catalogues to be used for the Malmquist bias 
correction and therefore the success of the correction would 
depend on the properties of this simulation, which is beyond 
the scope of this paper. 



5 RESULTS 

Here we present our results from using the methods de- 
scribed in Section [3] The aim is to show the ability of the 
methods to constrain as so that the result is not biased by 
nonlinearities. 

5.1 Results from the Gridding Method 

We applied the method in three different situations (see Sec- 
tion |4} : to a linear simulation of a small survey, a nonlinear 
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Figure 3. The green cross and green circle points are the results 
uncorrected for the sampling problem with and without uncer- 
tainties respectively. The blue cross and blue circle points are 
the results corrected for the sampling problem using equation I13I 
again with and without uncertainties respectively. The black tri- 
angle point in the top and middle panel is the result from doing 
the analysis with all the individual galaxies. The error bars in the 
middle and bottom panels are the mean la error on crs from each 
mock divided by the square root of the number of independent 
mocks (19). Top panel: Linear simulation with 1289 galaxies. The 
galaxy positions were selected from the 6dF mock catalogues to 
closely resemble the n(z) of the SFI survey. Middle panel: Non- 
linear simulation with 1289 galaxies. Galaxies were selected from 
the 6dF mock catalogues to closely resemble the n(z) of the SFI 
survey. Bottom panel: Nonlinear simulation with 13000 galaxies 
selected from the 6dF mock catalogues. The white area depicts 
clearly the expected constraint on as from a bin size of 20Mpc/h, 
since the errorbars shown are divided by the square root of the 
number of independent mock catalogues 



simulation of a small survey and a nonlinear simulation of 
the 6dFGS survey. The results are shown in Figure [3] 

The top panel of Figure [3] shows the results from using 
a linear theory simulation of a survey with 1289 galaxies 
and a SFI-like n(z). All cosmological parameters are kept 
fixed except for as , for which we calculate 68 per cent confi- 
dence limits. We take the galaxy positions from an N-body 
mock catalogue, to incorporate the galaxy clustering, and 
perform 100 velocity noise and galaxy position realisations. 
The points shown are the average values of as across these. 
The first point, at a bin size of zero, is the result of a stan- 
dard likelihood analysis as described in Section [2] using all 
the galaxies separately. As expected, the point agrees well 
with the fiducial model. 

The light (green) circles show results from noise-free 
simulations in which we have binned the velocity data but 
treated these binned velocities as separate independent trac- 
ers of the point velocity field in the theoretical predictions, 
i.e. we have used Eg. 1111 and 1121 instead of the approximate 
correction for the sampling problem on the diagonal ele- 
ments, given by Eq. [13] We see that the results are strongly 
biased, since there are some cells containing only a very 
small number of galaxies. In these cells the velocity field has 
not been smoothed considerably, yet the theoretical predic- 
tion in Eq. [12] has been smoothed, so as has to be large to 
fit the data. The results are shown for different bin sizes, 
and we see that as the bin size is increased the smoothed 
predictions fEa. I12[) work increasingly well, as expected. At 
small bin sizes we also expect the results to be more accu- 
rate because the data and theory are both smoothed very 
little. 

The dark (blue) circles show results from the same 
noise-free data, in which we now use the sampling prob- 
lem correction given in Eq. 1131 The fact that these are so 
close to the fiducial model is not trivial, since the correction 
is an approximate but practical one. The error bars are the 
survey errors divided by the square root of the number of 
realisations. Presenting the errors in this way does not rep- 
resent the expected error from 6dFGS, but it shows whether 
or not ag is biased significantly. The points agree with the 
fiducial within the error bars. It would be necessary to per- 
form more simulations to test the accuracy of the sampling 
problem correction further, and ultimately we expect it to 
break down. These simulations tell us that the correction 
works to better than 2.5 percent on as, which is sufficient 
for the near future velocity data. 

The crosses show results from simulations in which noise 
is added to each velocity at the 20 per cent level. The light 
(green) crosses are the results when the sampling problem 
correction is not implemented. Whereas the lack of this cor- 
rection caused a very large bias on as for the noiseless veloc- 
ity simulation, we see that the bias is not significant when 
a realistic noise level is used. The dark (blue) crosses corre- 
spond to realistic noise simulations with the sampling prob- 
lem correction implemented. These points are very slightly 
lower than those without the correction included (note that 
the same noise realisations were used for each so this rela- 
tive difference is significant despite the size of the error bars) . 
But the difference is small. Since there is so little difference 
between the blue and green cross points we conclude that 
doing an even more accurate window function involving the 
exact positions of every galaxy in each grid cell would be un- 



© 0000 RAS, MNRAS 000, 000-000 



Peculiar Velocities into the Next Generation 9 



necessary as it would offer negligible improvement. Finally 
because the fiducial is recovered beautifully by all but the 
uncorrected no noise points we find that the approximation 
mentioned in Section \3. 11 that the radial components of all 
galaxies in each grid cell are parallel, is good enough. 

The middle panel of Fig. [3] shows the effect of nonlinear 
theory, serving as a direct comparison with the top panel. 
The same survey parameters are used as for the top panel, 
and the only difference is the use of the nonlinear theory 
for the simulated noise- free velocities. The mock catalogues 
described in Section [4] are used, a total of 21 different cata- 
logues with the same fiducial cosmology. Again the point at 
zero bin size corresponds to the traditional method in which 
all galaxies are treated independently. This is expected to 
be biased by nonlinear theory, and indeed we see that the 
difference from the fiducial model is significant and around 
10 percent. This is not a large bias, but we have not matched 
the SFI sky coverage in our simulation, which would have 
been smaller and thus more susceptible to nonlinearities. 

The remaining points correspond directly to their coun- 
terparts in the top panel. The biases on the noise-free points 
are larger than before, especially at small bin sizes. Even for 
the corrected noise- free points (dark/blue circles) there is 
a disagreement which persists even to large bin sizes. The 
distribution of galaxies in the top panel of Figure [3] exactly 
matches the distribution in the middle panel. The differ- 
ence between the top and middle panels is just the peculiar 
velocity values themselves. In the top panel the peculiar ve- 
locities are derived from a linear theory simulation and in 
the middle panel they are derived from a nonlinear N-body 
simulation. As can be seen from the top panel the fiducial as 
is recovered precisely for all bin sizes and all points except 
the wholly unrealistic uncorrected no noise points. It should 
be noted that the no noise points with the sampling cor- 
rection do recover the fiducial. This shows that the method 
by itself does not cause any systematic uncertainties on the 
as result, and shot noise and velocity errors are properly 
accounted for. 

The differences between the top and middle panels of 
Figure [3] are therefore entirely due to the realities of the N- 
body simulation including nonlinearities (expected on small 
scales) and cosmic variance (expected on large scales) . Note 
that the cosmic variance effects will be larger for the SFI 
mock catalogues than for the 6dFGS catalogue since the 
number of selectable particles available in the finite num- 
ber of simulations is smaller. This cosmic variance will be 
different for the noisy and noise free points because they 
sample different parts of the simulations (the noisy points 
effectively sample the very low redshift parts of the simu- 
lation only, whereas the no-noise points also sample higher 
redshift points) . We do not address this further in this paper 
since our goal is to assess the contamination of nonlineari- 
ties in 6dFGS-like data. To investigate these cosmic variance 
effects further is beyond the scope of this paper particularly 
since it would require a larger suite of simulations. However 
this suggests that if a more accurate distance indicator, such 
as supernovae, were used in a survey with a similar geometry 
then this issue may need to be revisited. 

The crosses in the middle panel of Fig. [3] show the re- 
sults of the small SFI-like simulation with a realistic noise 
level added to the N-body simulation velocities. The noise 
washes out the nonlinearities to a large extent, as seen by 



comparing points at a small bin size of 5 hT 1 Mpc, where the 
no-noise (corrected) dark (blue) circle is much higher than 
the points with a realistic noise level. The nonlinearities are 
expected to affect galaxies at small separations the most. 
The most extreme case of small separations is the velocity 
autocorrelation, i.e. the diagonal elements of the covariance 
matrix. The noise essentially blankets the predicted diag- 
onals and therefore covers up biases due to nonlinear the- 
ory. Again there is little difference between the two sets of 
crosses, implying that the details of the sampling problem 
correction will not be extremely important on data with 20 
per cent distance errors. At large bin sizes the noisy points 
lie below the fiducial. We attribute this to cosmic variance 
from the finite number of mock catalogues. Since it occurs 
when the bin size is 20/i _1 Mpc or more, a volume which is 
at least a quarter of the size of the small survey, naturally 
the cosmic variance will be large. We are unlikely to want 
to average over such large volumes in practice because we 
would loose too much information, therefore we do not ob- 
tain more mock catalogues to improve the result. The bias 
appears to have been removed for the point at 10 hT 1 Mpc, 
which is one-eighth of the survey size. 

The bottom panel of Fig. [3] shows the results from us- 
ing a nonlinear simulation of the 6dFGS survey with 13000 
galaxies. It is not possible computationally to do the analy- 
sis with a bin size less than 10/i _1 Mpc, and at these scales 
the a s result appears to be biased high at 68 per cent con- 
fidence. A bin size of 20/i _1 Mpc appears optimum as there 
is a trade off between decreasing the constraint on as and 
overcoming the systematic bias due to the nonlinearities. 
The resulting expected constraint on as from this bin size 
is depicted by the white area on the plot since the errorbars 
shown are divided by the square root of the number of mock 
catalogues. At this bin size the bias on as from nonlinear- 
ities is less than 3 percent. This is about one- fifth of the 
size of the error expected on as from 6dFGS at this bin size 
and therefore we conclude that we have devised a method 
suitable for removing the nonlinearities in 6dFGS. 

5.2 Results from the PCA method 

We applied the PCA method outlined in Section \3. 2 1 to two 
different survey types (see Section 2}: to a nonlinear simu- 
lation of a small survey with a SFI-like galaxy distribution 
and a nonlinear simulation of the 6dFGS survey. We show 
results after averaging over all of the 21 mock catalogues. 

The top panel of Figure |4] shows the results from using 
a nonlinear simulation of 1289 galaxies with a SFI-like n(z), 
with a realistic noise level. No binning is done to the data. 
The x-axis shows the number of number of PCA moments 
removed. Note that the point at zero moments removed is 
exactly equivalent to the point at bin size zero on the mid- 
dle panel of Figure [3] As expected, as more moments are re- 
moved the bias caused by nonlinear theory is reduced. The 
uncertainties also increase, as expected. The bias to low as 
values when a large amount of smaller scale information is 
removed is consistent with our cosmic variance explanation 
of the low points at large bin sizes for the middle panel of 
Fig. El 

The vertical black dotted line shows the recommended 
number of moments to remove found from Eq. 1181 and this 
number is about 74. However, the bias has not reduced sig- 
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Figure 4. Top panel: The nonlinear simulation of 1289 galaxies 
with a SFI-like n(z), just the points with measurement errors are 
shown. The vertical black dotted line shows the recommended 
number of moments to remove found from Eq. 1181 Bottom panel: 
The nonlinear simulation of the 6dFGS survey. The points shown 
are with (dark/blue triangles) and without (light /green squares) 
the sampling problem correction applied. The horizontal solid line 
in both plots is the fiducial as- The error bars in both panels are 
the mean la error on as from each mock divided by the square 
root of the number of independent mocks (19). 



nificantly at this point. It seems that a greater number of 
moments should be removed than recommended by Eq. [18] 
When the bias on as disappears (at about 800 moments re- 
moved) the error bar on as divided by the square root of the 
number of mocks is about 0.05. Comparing this to the re- 
sult for the smaller survey from the gridding method at the 
optimum bin size of 10/i _1 Mpc (see middle panel Figure [3}, 
the equivalent error is 0.03. 

The bottom panel of Figure [4] shows the results from us- 
ing a nonlinear simulation of the 6dFGS survey with 13000 
galaxies. Since we expect around 10,000 to 13,000 peculiar 
velocities from 6dFGS we cannot do the PCA analysis de- 
scribed above on the individual galaxy velocities. This would 
mean doing calculations with at least 10,000x10,000 matri- 
ces which is too computationally intensive. Instead for the 
nonlinear simulation of the 6dFGS survey we apply the PCA 
analysis to the data after it is binned with a cell size of 



20/i _1 Mpc using the gridding method (see bottom panel of 
Figure [3} . 

Results are shown with (dark/blue triangle points) and 
without (light/green square points) the sampling problem 
correction applied. This time the x-axis is percentage in- 
stead of the number of moments removed. This is because 
after binning the data the total number of cells containing 
data varies with the original galaxy distribution. Since the 
total number of cells varies between mock catalogues and re- 
alisations of mock catalogues to be sure of being consistent 
between realisations we instead remove the same percentage 
of moments each time. This time Eq. [18] recommends that 
no moments need to be removed and this is borne out by the 
result in the bottom panel of Figure 2] The PCA analysis in 
this particular case shows no improvement over the gridding 
method. 

5.3 Forecast for the 6dFGS survey 

From the nonlinear simulation of the 6dFGS survey we 
present 1 and 2a likelihood contours in the Q m -<78 plane 
using the gridding method with a bin size of 20/i _1 Mpc. 
The resulting contours in Figure [5] are after averaging over 
all of the 21 mock catalogues. All cosmological parameters 
except for Q m and as were kept fixed. The thick contours are 
for the equivalent of the dark/blue cross point at 20/i _1 Mpc 
in Figure [3] (bottom panel), so the binning has been ac- 
counted for by using Eq. 1131 The fiducial Q m and as for the 
mock catalogues are indicated by the black triangle. After 
marginalising; over £lrn we find as — 0.78 =b 0.14. 

The thin contours show the result for a peculiar veloc- 
ity survey with an uncertainty that matches that expected 
from supernovae, of 5 percent error on the distance indica- 
tor, instead of the 20 percent assumed in the rest of this 
paper. The same 21 6dFGS mock catalogues are used ex- 
cept now the distance error has been reduced. This is not 
a representation of a realistic future supernova survey since 
the sky and redshift coverage of such a survey would be 
much larger. The resulting contours just show the increase 
in statistical power when using a distance indicator with the 
same observational error as supernovae. The constraints on 
as at the fiducial Q m are a factor of two tighter from the 
supernovae than the galaxies. This is despite the factor of 
four decrease in velocity error, which implies that the cosmic 
variance limit is being reached. After marginalising over £lrn 
we find a 8 = 0.84 ± 0.08. 

The contours have been overlaid on the joint con- 
strai nts on as and l] m from the 100 deg 2 weak lensing sur- 
vey (Ben jamin et al.|[2007f ) assuming a flat ACDM cosmol- 
ogy and adopting the nonlinear matter power spectrum of 
ISmith et all {2003). The weak lensing contours depict the 
0.68, 0.95, and 0.99% confidence levels. The models are 
marginalised, over h = 0.72 ± 0.08, shear calibration bias, 
and the uncertainty in the redshift distribution. 

As can be seen from Figure [5] the shape of the contours 
from the two different probes is different. This is because 
weak lensing and peculiar velocities probe the power spec- 
trum differently. The weak lensing contours follow lines of 
constant as^^ 68 , the normalisation of the power spectrum, 
while the contours from peculiar velocities are more indepen- 
dent of the value of Q m • appears in the velocity analysis 
both in the normalisation of the velocity signal and in the 
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Figure 5. The thick contours are the 1 and 2a contours for the 
gridding method with a bin size of 20h~ 1 Mpc when the sampling 
problem is corrected for with Eq. 1131 The thin contours are the 
same as the thick contours except repeated for a 5% distance er- 
ror, representing results from a supernova peculiar velocity survey. 
The fiducial f2 m and erg for the mock catalogues are indicated by 
the black triangle. The contours have been overlaid on the joint 
constraints on as and fl m from the 100 deg 2 weak lensing survey 
(grey scale contours) assuming a flat ACDM cosmology, depicting 
the 0.68, 0.95, and 0.99% confidence levels. 



shape of the matter power spectrum. The two effects par- 
tially cancel out to produce the flatter contours seen in the 
figure. The peculiar velocity contours close at low Q m but 
are more open at higher f2 m . 



6 CONCLUSIONS 

In this paper we have demonstrated a new method, the grid- 
ding method, for the analysis of the 6dFGS survey that re- 
moves the contribution to the peculiar velocities from the 
nonlinear scales. The method uses a grid to average over 
galaxies in the same volume of space, then performs the 
usual likelihood analysis with the averaging taken account 
for by multiplying the power spectrum in ^ with a window 
function describing the shape of the grid cells. 

We find a practical approach to taking into account the 
different effective amounts of smoothing in each grid cell, 
due to the different number of galaxies in each cell. We have 
shown that this works for linear theory simulations with 
and without noise. We find that it works well for nonlinear 
theory simulations for realistic amounts of noise. We find for 
a survey such as 6dFGS we can estimate as with less than 
3% bias from nonlinearities, acceptable within the expected 
error bars. 

We have com pared our approach w ith t he optimal mo- 
ment s method of Wat kins et all (|2002h and iFeldman et al.l 
( 2003) which uses a principal component analysis to select 
a set of optimal moments constructed from linear combina- 
tions of the peculiar velocities that are least sensitive to the 
nonlinear scales. We conclude that for our mock catalogues 
of the 6dFGS survey averaging in grid cells performs simi- 
larly well as compared with the PCA analysis. The error bar 
of the apparently unbiased point is a similar size for the PCA 



analysis as for the gridding method. We have shown PCA is 
an excellent technique which advantageously uses data com- 
pression to successfully remove the nonlinear signal in the 
velocities. However at some size of data set PCA cannot be 
applied to the whole survey, which could be overcome by di- 
viding the survey up into smaller pieces because of the large 
matrices involved. This could be alleviated by performing 
the PCA analysis on each piece individually before combin- 
ing the results although this looses some information on the 
largest scales in the survey, which are most accurately de- 
scribed by linear theory. Gridding is an alternative, though 
less elegant, data compression technique which retains the 
large scale information when dealing with large surveys. 

In order to show the effect of the methods on removing 
biases from nonlinearities we did not consider other biases 
inherent in peculiar velocity analysis. In particular this pa- 
per did not attempt to deal with the minefield of Malmquist 
bias, arguably the next most difficult bias to correct; this is- 
sue will be the subject of future work. 

For the gridding method using a bin size of 20/T 1 Mpc 
we presented the 1 and 2a likelihood contours for Qm-as 
to show the constraint expected from 6dFGS. The expected 
error on as after marginalising over Q m is approximately 
16 percent. We also showed the constraints expected from 
a supernovae peculiar velocity survey the same size and sky 
coverage as the 6dFGS survey, this time the expected error 
on a 8 is 0.08 after marginalising Over £lrn • A survey with this 
number of supernovae could po tentially be performed us - 
ing data from the GAIA mission (jBelokurov fe Evandl2003h . 
which expects approximately 14 000 local Type la super- 
novae out to a redshift of 0.14 over the whole sky. The dif- 
ferent degeneracy direction in the as-Qm plane will be use- 
ful in breaking degeneracies, see also lGordon. Land &; Slosarl 
([200 /t ). Another survey comm encing at the tim e of writing 
is the Sky Mapper Telescope ([Keller et al.ll2007l ) which aims 
to perform a multi-band survey of the southern sky and ex- 
pects to discover 100 Type la supernovae per year out to a 
redshift of 0.085. Furthermore, since velocities can probe as 
at very low redshift then they can in principle be combined 
with other probes to measure the growth of structure and 
thus properties of dark energy or modified gravity. 

The constraints on cosmological parameters from 
6dFGS will be a helpful addition to existing data, and we 
see that the accuracy is not greatly improved in using a 
more accurate distance indicator such as Type la super- 
novae. We conclude that the limitation is largely due to the 
small cosmic volume probed. Obtaining distance estimates 
to objects in a large volume is very difficult. The kinetic 
Sunyaev-Zerd ovich effect in clusters of g alaxies is one pos- 
sible method (|Sunvaev &; Zeldov ich 1980) although this is a 
difficult challenge observa tionally ([Knox, Holder, Church! 
2004; iDiaferio et al.ll2005r ). The current interest in large su- 
pernova surveys could potentially yield a useful data set, 
since the velocity error per unit area of sky is constant with 
distance. 

The peculiar velocity data from the 6dFGS survey can 
be used to estimate not just as but also the shape of the mat- 
ter power spectrum. This can be compared to results from 
the Two degree F ield Galaxy Redshift Survey (2dFGRS, 



trie Two degree F ield Ualaxy Kedshiit Survey (^dFUKS, 
Colless et al]j 2001) and Sloan Digital Sky Survey (SDSS, 



York et al.ll2000h which will help to constrain galaxy bias- 



ing. 
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